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PERMUTATIONS FIXING A k- SET 


SEAN EBERHARD, KEVIN FORD, AND BEN GREEN 


Abstract. Let i(n,k) be the proportion of permutations n € S n having an invariant set 
of size k. In this note we adapt arguments of the second author to prove that i[n, k ) x 
k~ s ( 1 + log A:) -3 / 2 uniformly for 1 ^ k ^ n/ 2, where <5 = 1— 1+1 I °| * l ° s2 . As an application 
we show that the proportion of 7r £ S n contained in a transitive subgroup not containing 
A n is at least n~ s+o( ' 1 ' > if n is even. 


1. Introduction and notation 


Let k,n be integers with 1 ^ k ^ nj 2 and select a permutation tt e S n , that is to say a 
permutation of {1,... , n}, at random. What is i(n, k), the probability that ir fixes some set 
of size k? Equivalently, what is the probability that the cycle decomposition of n contains 
disjoint cycles with lengths summing to k? 

Somewhat surprisingly, i(n, k) has only recently been at all well understood in the pub¬ 
lished literature. The lower bound lim., woo i(n, k) log k/k is contained in a paper of 
Diaconis, Fulman and Guralnick [DFG08] . while the upper bound i(n,k) <C A: -1 / 100 may be 
found in work of Luczak and Pyber [LP93j . (These authors did not make any special effort 
to optimise the constant 1/100, but their method does not lead to a sharp bound.) Here and 
throughout A" <V Y means X CY for some constant C > 0. The notation X x Y will 
be used to mean X <^Y and X » V. In the limit as n —» oo with k hxed, a much better 
bound was very recently obtained by Pemantlc, Peres, and Rivin [ PPR , Theorem 1.7]. They 
prove that lim^oo i(n, k) = k~ s+o( ' 1 \ where 


6 


1 - 1 + 1 ° gl ° g2 « 0.08607. 
log 2 


They also note a connection between the problem of estimating i(n, k) and a certain number- 
theoretic problem, an analogy that will be also be key to our work. The same connection 
has also been observed by Diaconis and Soundararajan |Soul3] page 14]. 

Let us explain the connection with number theory. There is a well known analogy (see, for 
example, [ AET9 7]) between the cycle decomposition of a random permutation and the prime 
factorisation of a random integer. Specifically, if 7r is a random permutation with cycles of 
lengths ai < a 2 < .. •, and if n is a random integer with prime factors Pi < P 2 < ■ ■ ■ then one 
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expects both sequences log ai, log 02 ,... and log log pi, log log p 2 ,... to behave roughly like 
Poisson processes with intensity 1. (Of course, this does not make sense if taken too literally, 
since the a, are all integers, and the p t are all primes, plus we have not specified exactly 
what we mean by either a “random permutation” or a “random integer”.) The condition 
that + ■ ■ ■ + a im = k (that is, that a particular set of cycle lengths sum to k) is, because 
the at are all integers, equivalent to k ^ • - + aj m < k + 1. Pursuing the analogy between 

cycles and primes, we may equate this with the condition k ^ logp^ + • • - + log pi m ^ k +1 , or 
in other words e k ^ p ^ • • • p irn ^ e k+1 . This then suggests that we might compare i(n, k) with 
i(n,k ), the probability that a random very large integer (selected uniformly from [e n ,e n+1 ), 
say) has a divisor in the range [e k ,e k+1 ). 

This last problem has a long history, originating as a problem of Besicovitch [ Bes34j in 
1934, and was solved (up to a constant factor) by the second author |For08al lFor08bj . In 
those papers it was shown that i(n, k) x k~ 5 (l + log k)~ 3 ^ 2 uniformly for k ^ n/ 2, where 5 
is the constant mentioned above. In this paper we use the same method to prove the same 
rate of decay for i(n, k ). 

Theorem 1.1. i(n,k) x k~ s ( 1 + log fc) -3 / 2 uniformly for 1 ^ k ^ n/2. 

Since i(n,n — k) = i(n, k ), Theorem 11.11 establishes the order of i{n, k) for all n, k. 

Theorem 11.11 has implications for a conjecture of Cameron related to random generation 
of the symmetric group. Cameron conjectured that the proportion of 7r G S n contained 
in a transitive subgroup not containing A n tends to zero: this was proved by Luczak and 
Pyber jLP93] using their bound i(n,k ) k ^ 1 / 100 . Cameron further guessed that this pro¬ 
portion might decay as fast as ?7r 1//2+0 ( 1 ) (see |LP931, Section 5]). However Theorem 11.11 has 
the following corollary. 

Corollary 1.2. The proportion of n G S n contained in a transitive subgroup not containing 
An is n 5 (logn) 3 / 2 , provided that n is even and greater than 2. 

Proof. By Theorem ll. li the proportion of 7r G S n fixing a set Bi of size n/2 is x n _<s (log n) -3 / 2 . 
Such a permutation 7r must also fix the set B 2 = {1,... ,n} \ Bi, and thus preserve the 
partition {Bi, B 2 } of {1,... ,n}. Since \Bi\ = \B 2 \, the set of all r preserving this partition 
is a transitive subgroup not containing A n . □ 

We believe that a matching upper bound 0(n _<5 (logn)~ 3 / 2 ) holds in Corollary 11.21 and 
that for odd n there is an upper bound of the form 0(n~ s ) for some 5' > 5. We intend to 
return to this problem in a subsequent paper. 

Whether or not a permutation 7 r has a fixed set of size k depends only on the vector 
c = (ci(7r), c 2 (7t), ..., Cfc(7r)) listing the number of cycles of length 1, 2,..., k, respectively, 
in n. Crucial to our argument is the well known fact (see, e.g., All41) 7 ) that for fixed k , 
c has limiting distribution (as n — > 00 ) equal to = (Ad, A" 2 ,..., Ad), where the X t are 

independent and Xj has Poisson distribution with parameter 1/i (for short, A \ = Pois(l/z)). 
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A simple corollary is that the limit i(oo,k) = lim n _ ) . C)0 i(n, k) exists for every k. Define, for 
any finite list c = (ci, c 2 ,..., c k ) of non-negative integers, the quantity 

££ (c) = {mi + 2 m 2 + • • • + km k : 0 ^ rrij ^ Cj for j = 1 , 2 ,...,/;}. (1.1) 

We immediately obtain that 

i(oo,lfe) = P(/c G 2gf(X fc )). (1.2) 

This makes it easy to compute i(oo,k) for small values of k. For example we have the 
extremely well known result (derangements) that 

i( oo, 1) = P(Xi ^ 1) = 1 - - « 0.6321, 

e 

and the less well known fact that 

/(oo, 2) = 1 - P(W = X 2 = 0) - P(Ad = 1, X 2 = 0) = 1 - 2e _3/2 « 0.5537. 

When k is allowed to grow with n, the vector c is still close to being distributed as X*,, 
the total variation distance between the two distributions decaying rapidly as n/k oo 
[AT92| . This fact is, however, not strong enough for our application. We must establish an 
approximate analog of (11. 2 j) . showing that i(n, k) has about the same order as P (k € 2zf(X k )), 
uniformly in k ^ n/2. 

Instead of directly estimating the probability of a single number lying in Sf(X k ), how¬ 
ever, we apply a local-to-global principle used in [ForOSal ForOSbj to reduce the problem to 
studying the size of Sf(X k ). We expect a positive proportion of the elements of 2zf(X k ) to 
lie in the range [A/c, 10 k\ (say). The reason for this is that we expect to find ~ 1 index j 
for which Xj > 0 in any interval [e*,e* +1 ]. In particular, it is fairly likely that there is some 
such j with j > k/10 , in which case at least half of the sums mi + 2m 2 + • • • + km k will be 
^ /c/10 (those with rrij > 0), yet at the same time it is reasonably likely that all elements 
of Jzf(Xfc) are < 10 k. Assuming this heuristic is reasonable, we might expect that 

i(n, k ) x P (k e Se{X k )) x jE\J?(X k )\. (1.3) 

In Section El we will show that (jl.3j) does indeed hold. The main result of that section is 
the following. 

Proposition 1.3. i(n, k ) x -j;E|2zf(X fe )| uniformly for 1 ^ k ^ n/2. 

Our main theorem follows immediately from this and the next proposition, whose proof 
occupies Sections 0] (lower bound) and [5] (upper bound). Note that in these propositions we 
operate with the sequence X k = (X 1; X 2 , ..., X k ) of genuinely independent random variables, 
which is independent of n. 

Proposition 1.4. E|2z?(Xfc)| x AW 5 (1 + log k)~ 3 ^ 2 . 

To briefly explain the origin of the exponent <5, we first observe the simple inequalities 

|2Sf(X fc )| ^ min (2 Xl+ - +Xfc , 1 + X 1 + 2X 2 + • • • + kX k ) . (1.4) 
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Assume this is close to being sharp with reasonably high probability, and condition on 
Y — X 1 + ■ • • + X k , the number of cycles of length at most k in a random permutation. 
Following our earlier heuristic, the second term on the right side of fll.4)1 is x k most of the 
time, and so there is a change of behaviour around Y = + 0(1). Since Y is Poisson with 

parameter log k + 0(1), a short calculation reveals that Emin(2 5 , k) x A; 1-5 (log A;) -1 ' 2 . We 
err in the logarithmic term due to the fact that (jl.4[l is only sharp with probability about 
1/logfc, a fact that is related to order statistics |For08bl Sec. 4], 

Let us finally mention two open questions. 

Question 1. Is there some constant C such that z(oo, k) Ck~ 5 (log k)~ 3 / 2 ? 

It would be surprising if this were not the case. 

Question 2. Is i(oo, k) monotonically decreasing in kl 

Data collected by Britncll and Wildon [ BWj shows that this is so at least as far as i(oo, 30), 
and of course a positive answer is plausible just from the fact that «(oo, k) —* 0. 


2. A PERMUTATION SIEVE 


As mentioned in the introduction, the asymptotic distribution (as n —> oo with k fixed) of 
the cycle lengths (ci(7r),..., c*;(7r)) of a random 7r G S n is that of X k = (X \,..., X k ), where 

the Xi are independent with Xf = Pois(l/i). In the nonasymptotic regime, where n may be 
as small as 2k, this property is lost. We do, however, have the following substitute which 
will suffice for this paper. 


Proposition 2.1. Let 1 ^ m < n and C\,, c m be non-negative integers satisfying 

Ci + 2 c 2 + • • • + mc m ^ n — m — 1. 

Suppose that tt e S n is chosen uniformly at random. Then 

^ P(ci(7r) = Cl, . . . , C m (7r) = C m ) ^ 


(2m + 2) n™ i CiH Ci 


(m + 1) nr=i 


We will prove this shortly, but first let us fix some notation. As every permutation 7r G S n 
factors uniquely as a product of disjoint cycles, in keeping with the analogy with integers 
we say that any product of these cycles, including the empty product, is a factor or divisor 
of 7T. The sets induced by these factors are precisely the invariant sets of 7r. We make the 
following further definitions: 

• Ck, n is the set of cycles of length k in S n ; 

• | a | is the length of any factor cr (of some permutation in <S n ); 

• t\tt means that r is an invariant set or divisor of n. 

The following lemma is a slight generalization of the well known formula of Cauchy. 
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Lemma 2.2. Let 1 ^ m ^ n, and let c ±,..., c m be non-negative integers with t — c\ + 2c2 + 
• —b mc m ^ n. Then the number of ways of choosing c\ + • —f c m disjoint cycles consisting 
of Ci cycles in C i;n for 1 ^ i ^ m is 


(■n — t )! 


m 


n 


i 


Proof. First count the number of ways of choosing the subsets that make up the cycles, and 
then multiply by the number of ways to arrange the elements of these subsets into cycles. 
The result is 



Cl C2 Cm 


which simplifies to the claimed expression. 


□ 


Our next lemma is an analogue for permutations of a basic lemma from sieve theory. 

Lemma 2.3. Suppose that m,n are integers with 1 ^ m ^ n. Let n E S n be chosen 
uniformly at random. Then 

-^ P(7r has no cycle of length < m) ^ —. 

2 m m 

Remarks. Both upper and lower bounds are best possible, since trivially the probability 
in question is exactly 1/n when n/2 < m ^ n (if a permutation has no cycle of length < m, 
with m in this range, then it must be an n-cycle). In fact, it is not difficult to prove an 
asymptotic formula ~ c o(n/m)/m (n —> oo, m —> oo, m ^ n) for the probability in question, 
where u> is Buchstab’s function and uj(u) —> e -7 as u —> oo [Gra061 Theorem 2.2], 


Proof. (See the proof of (Gra06l . Theorem 2.2]). We phrase the proof combinatorially rather 
than probabilistically; thus let c(n,m ) be the number of permutations of S n that have no 
cycles of length < m. We proceed by induction on n, the result being trivial when n — 1. 
Let ^2* denote a sum over permutations with no cycle of length < m. Using the fact that 
the sum of lengths of cycles in a permutation in S n is n, we get 


£‘«=£* E m = 5>E E" 1 


7 rG«S 

n 7 tGS n Cr|7T 

a a cycle 


<t\tv 


y. k y c{n - k, m) + y 

m^k^n—m crECk n 


n 


( jE.Cn 


= n 


'+ E 


n\ 


m^k^n—m 


(n — k)\ 


c{n — k, m). 










6 


SEAN EBERHARD, KEVIN FORD, AND BEN GREEN 


If | < m ^ n, then c(n, rn) — — and the result follows. Otherwise, by the induction 
hypothesis, 


nc[n 


. , u: n — 2m+l\ n\ ■ n 

, m) ^ n\ + 2^ — = n\ I 1 +- 1 ^ 


m<k<n—m 


m 


m 


and 


nc[n 


. . \—> n\ I n — 2m + 1\ ^ n\ ■ n 


m^k^n—m 


2m 


2m 


□ 


It is now a simple matter to establish Proposition 12.11 


Proof of Proposition lg.il Let t = c\ + 2c2 + • • • + mc m . For each choice of the c\ + • • • + c m 
disjoint cycles consisting of Cj cycles from C j n (1 ^ j ^ m), there are c(n — i, rn + 1) 
permutations 7r G S n containing these cycles as factors and no other cycles of length at most 
m, where c{n — t, m + 1) is the number of permutations on n — t letters with no cycle of 
length < m + 1, as in the proof of Lemma [2.31 Applying Lemmas 12.21 and 12.31 completes the 
proof. □ 


3. The local-to-global principle 


As in the introduction, let Ad, X 2 ,... be independent random variables with distribution 
Xj = Pois(l/j). We record here that 

E|JSf(X fc )| = Y, |^(c)|P(Ad = Cl )---P(AA = c fc ) = e-^ V J. f^ 1 , (3.1) 

cx. C k >0 c 1 ,...,c k >on i =l*h Ci 

where hk = 1 + \ + • • • + p- We also record the inequalities 

log (A; + 1) ^ h k ^ 1 + log A;, (k ^ 1) (3.2) 

which may be proved, for example, by summing the obvious inequalities ^ f™ +1 dt/t ^ A 
Lemma 3.1. Let k G N, ci,..., Ck ^ 0, / C [A;] and c[ = c, for i I, c\ = 0 for i G I. then 

1^(01 W) inc + i). 

iei 


Proof. Clearly, Af(c) is the union of n«e/( c * + 1) translates of Jf(c'). □ 

Lemma 3.2. Suppose that P ^ t. Then 


1e|jz?(x ( )| < 1e|^(x,,)|. 
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Proof. By Lemma HOI |J??(X^)| ^ (1 + Xg/ + \) •••(! + Xg)\Jf (X^)|. Thus by independence, 

E|JSf(X £ )| ^ ( n E(l +W))e|^(X,,)| = ^E|jgf(x^)| < |e|2S?(X,,)|. □ 

' i=l '+1 ' 

We also need to compute the mixed moments of |Jz?(Xfc)| with powers of some X 3 . Recall 

that the mth moment EX m , if X = Pois(l), is the mth Bell number B m . The sequence of 
Bell numbers starts 1, 2, 5,15, 52, 203,.... 

Lemma 3.3. Suppose that ji,... ,jh ^ k are distinct integers and that cq,..., are positive 
integers. Then 

E\S?(X k )\X£...X% ^ ^l^E|j5f(X fc )|. 

We may take C ai , -Ah — nil (An + B ai+ 1 ). In particular we may take C\ — 3. 

Proof. Define X' fc by putting X' ]{ = • • • = X'- h = 0 and X'- = X 3 for all other j. By 
Lemma 13. 11 we have 

l^(x*)| < |jgf(x;)|(i + x n ) ■ ■ ■ (i + x jh ). 

Thus by independence 

h 

F.\J?(X t )\X‘; ■ ■ ■ X% $ E|JSf(X'*)| ReXJ + EX£»). (3.3) 

2—1 

For X = Pois(A) we have EX m = </> m (A), where </> m (A) is the m-th Touchard (or Bell) 
polynomial, a polynomial with positive coefficients and zero constant coefficient. If A ^ 1, 
it follows that EX m ^ A B m for m ^ 1. The result follows immediately from this, (13.31) . and 
the observation that E|jSf(X^.)| ^ E|2z?(Xfc)|. □ 

We turn now to the proof of Proposition 11.31 In what follows write S(X.g) = X\ + 2A" 2 + 

• • • + IXg = max 25? (X^). We will treat the lower bound and upper bound in Proposition II.31 
separately, the former being somewhat more straightforward than the latter. 

Proof of Proposition ITT! (Lower bound). If k < 40 then i(n, k ) x i(oo, f)xlx AE|25f(X fc )|, 
so we may assume k ^ 40. Let r = \k/2t)\ (so r ^ 2), and consider the permutations 
7T = aaia 2 /3 G S n , where cu and a 2 are cycles, |a| ^ 4r < |oq| < |cr 2 1 < 16r, all cycles in a 
have length ^ r, all cycles in /3 have length at least 16r, and cxoqoq has a fixed set of size 
k. Because of the size restrictions on a, oq, cr 2 , if a is of type c = (<q,..., c r ), with q cycles 
of length i for 1 ^ i ^ r, then the last condition is equivalent to k — |<7i| — |cr 2 | G 2z?(c). In 
particular |cq| + |cr 2 | ^ k, and hence n — |a| — |<7i| — |cr 2 | ^ ^ 16r. Fix c and I\,I 2 with 

Ar < fi < l 2 < 16 r such that k — i\ — I 2 G 2z? (c). By Proposition 12.11 the probability that 
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a random tt G S n has c* cycles of length i (1 ^ i ^ r), one cycle each of length £i, i 2 and no 
other cycles of length < 16r is at least 

1 1 

32r^4 nr=i ^ 2 is r 3 ni=i 

For any l\ satisfying 4r + 1 ^ l\ ^ 8r — 1, there are |25?(c)| admissible values of f 2 > t\ for 
which k — — i 2 G 2z?(c), since max 25? (c) ^ 4r ^ k/5. We conclude that 


i(n, k ) ^ 


4r — 1 

213^,3 


X 


|^(c)| 


ci,”* ,c r >0 
S'(c)^4r 


n 


i =1 


Ci'.l 


\i c i 


As in (13.11) . the sum above equals e fer E|25?(X r )|l,s(x r )s;4r- Hence, by (13.21) . we see that 

i(n,k) ^ ^ r E|if(X r )|l S(Xr )^4r- 
To estimate this, we use the inequality 


By Lemma [3.31 we have 


ls(X r )<4r ^ 


g(X r ) 

4r 


E|25?(X r )|S(X r ) = ^E|25?(X r )|jXj < 3rE|25?(X r )|. 

3 = 1 


It follows that 

>-b E |. 2 >(X r )|. 

Finally, the lower bound in Proposition 11.31 is a consequence of this and Lemma 13-dfl . 


□ 


Proof of Proposition [~Q1 (Upper bound). Temporarily impose a total ordering on the set of 
all cycles Ck, n , first ordering them by length, then imposing an arbitrary ordering of 
the cycles of a given length. Let i r G S n have an invariant set of size k. Let k\ — k and 
k 2 — n — k. Then i r = nin 2 , where 7 Tj is a product of cycles which, all together, have total 
length kj, for j = 1, 2. For some j G {1, 2}, the largest cycle in 7r, with respect to our total 
ordering, lies in 7r 3 _j. Let cr be the largest cycle in ttj, and note that |cr| X min (Ad, k 2 ) = k. 
Write 7r = aa/3, where a is the product of all cycles dividing 7r which are smaller than cr 
and (5 is the product of all cycles which are larger than cr. In particular \/3\ 2s |cr| since (3 
contains the largest cycle in tt as a factor, and thus |cr| $2 \fi\ = n — |cr| — |a|. 

By definition of cr and a, aa has a divisor of size kj. Suppose \cr\= i and c = (ci, c 2 ,..., cf) 
represents how many cycles a has of length 1,2,... respectively. Then kj — i G 25?(c). 

3 Strictly for the purposes of proving our main theorem, this appeal to Lemma |3.21 is unnecessary. However, 
that lemma is straightforward and it is more aesthetically pleasing to have E|2z?(Xfc)| in the lower bound for 
i(n,k) rather than E|j£?(X r )|. 
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For £ and c satisfying this last condition, the number of possible pairs a, a is at most (by 
Lemma [ 


n\ 


n 


X 




n\ 


(n — |a| — |cr|)! LL q! i Ci (q + l)\£ c i+ l £(n — |a| — |cr|)! d\i Ci 


n 


i<e 


Given a and cr, since |<r| ^ n — |a| — |cr|, Lemma [2.31 implies that the number of choices for 
/3 is at most (n — |a| — |cr|)!/|cr|. Thus 

2 


■(".‘KEEs e n 1 


£ 2 

j =1 £= 1 ci,...,C£^0 

kj-£eS?( c) 


Ci'.l 


e e n 

j =i c!,...,c fc yo iyfc 


c,;! i 


E 


kj-ie&lc) 


£ 2 ’ 


where m(c) = max{i : c* > 0} U {1}. With c fixed, note that l ^ max(m(c), k 3 — S( c)). 
Also, the number of £ such that kj — £ £ J 2 ?(c) is at most | 22 ?(c)|. Thus, the innermost sum 
on the right side above is at most 

l-g(c)| 

max(m(c), kj — 5(c)) 2 
Like (13.11) . using (j3.2j) we thus see that 

l^(X*)| 


i(n , fc) ^ 2e/::E 

To bound this we use the inequality 


max(m(X fe ), k - S(X k )) 2 ' 


C2 


(3.4) 


max(m, k — S) 2 fc 2 

which can be checked in the cases S ^ k/2 and S ^ k/2 separately. It follows from this and 
that 


i(n, k) < 8e^E|if (X,) | + Se^E 1 ^^^™ 2 . 
k k m(X+ 


(3.5) 


The hrst of these two terms is what we want, but the second requires a keener analysis. 
By conditioning on m = m(X*.) we have 


E 


j-ggfcMXfc) 

m(X k y 


2 k ^ 


E — E l^(c)|5(c) 2 P(X m = c)P(X m+ i = • • • = X k . = 0) 


m 

771=1 Cl,...,C m ^0 


E-EY m S(X m ) 2 l Xm)1 exp - V - 

' m z V 7 


m=l 


J =777+1 


ft* 1 

k m 

777=1 
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Here we have written Y m = |j£f(X m )| for brevity, and in the last step we used the crude 
inequality lx m ^i ^ AT m . Expanding S(X m ) 2 = (X\ + 2X 2 + • • • + mX m )' 2 and using (13 .5 p . 
we arrive at 

1 1 k 1 m 

i(n , k) « ±E |Sf (X fc ) I + p ^ - XI ™'EY m WXA- (3.6) 

m=l j,j'=l 

The innermost sum is estimated using Lemma 13.31 splitting into various cases depending on 
the set of distinct values among 

Case 1: all distinct. Then ii , KY m X i X i rX m ^ c ’ 1 ^’ 1 EY m = ^EY m . 

Case 2: i = i' ± m. Then ii r EY m X i X v X m < EY m sC Ci j2 EY m = 21EY m . 

Case 3: i — i' — m. Then ii'FY m XiXi'X m ^ C^mEY m = 20mEY m . 

Case 4: i ^ i! = m or i 1 ^ i = m. In both cases n'KY m XiXi/ X m ^ 21EY, n . 

Summing over all cases, it follows that 

m 

ii'EY m X i X il X m < rnEY m . 

Since clearly EY m ^ EY k for every m ^ k the result follows from this and (13. 6p . □ 


4. The lower bound in Proposition 11.41 


In this section we prove the lower bound in Proposition 11.41 and hence the lower bound 
in our main theorem. We begin by noting that from (13. lj) and (13.2p follows 


E|js?(x fc )| 




ek ^ rr fc Ci \i ci ’ 

ci,...,c k ^0 ilj=i * 


(4.1) 


If we fix r = Ci + • 
permutation, then 


+ Cfc, which we may think of as the number of cycles in a random 

y^ l°^( c )l _ y^ l°^*( a )l 2 ) 


where 

-^*(a) = {E a < :/ c[r]}. (4.3) 

iei 


The equality is most easily seen by starting from the right side and setting c t — \ {j : a 3 — i } | 
for each i: then 2z?(c) = Jf*( a), J([ i=1 ^ Ci = a i‘'' a k, and each c = (c 1; ..., c*) comes from 
ci!- r ; i different choices of a 1; ..., a&. One may think of a 1; ..., a r as the (unordered) cycle 
lengths in a random permutation, in this case conditioned so that there are r total cycles. 
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Now let J = 


log k 
iog2 


and suppose that bi,... ,bj are arbitrary non-negative integers with 


sum r. Consider the part of the sum in which 


2*—1 

bi = c i (* = 2 ’''' ’ J )’ C = 0 (j > 2 J - 1). 

i=2 i-i 


Equivalently, suppose there are exactly 5* of the dj in each interval [2* 1 ,2* — 1], Writing 
0 (b) = nii(2 i - 1 , • • •, 2 f - 1} 6 % we have 


}_ l°^*( a )l 

r! ai ■ ■ • a r 

CL\ j . . . j— 1 


6 X ! - - - 6j! ^ 

bi,—,bj J dG^(b) 


l^(d)l _ 

di ■ ■ ■ d r 


(4.4) 


To see this, fix bi,... ,bj and observe that there are bi \ r '\ bj i ways to choose which bi of the 
variables ai,... ,a r lie in [2* _1 , 2* — 1] for 1 ^ i ^ J. 

Combining (14.ip . (14.21) and il 1. II) gives 


MUI»T E 


& 1 -I-h bj=r 


■V- 


E 

deg>(b) 


l^*(d)| . 

d\ - ■ ■ d r 


Thus in particular one has 


E|JSf(X*)| » 


\ E 

bi~\ -h bj—J 



E 

de^(b) 


d\ - ■ ■ dj 


(4.5) 


(This may seem wasteful at first sight, but in fact a more careful - though unnecessary - 
analysis would reveal that the main contribution is from r = J + 0(1), so this is not in fact 
the case.) In the light of this, the motivation for proving the following lemma is clear. 

Lemma 4.1. For any b = (6 1; ... ,bj) with bi + • • • + bj = J we have 


E 

de®(b) 


l^*(d)| 

d\ ■ ■ ■ dj 


> 


(2 log 2)- 


T.U 2 fa 


H- \-bi~i 


Proof. Given feN, let R( d, 7) be the number of 7 C [ J] with 7 = 7 dj (One should think 

of the number of cycles with lengths summing to precisely l in a random permutation.) 
Then ^i?(d, 7) = 2 J . Also, define \ = V.) for 1 ^ i ^ J (thus X t ~ log 2). By 
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Cauchy-Schwarz, 






£ S)(£ 

de®(b),e J/ V dG^(b) 


]^(d)l \ 

d\ ■ ■ -dj )' 


(4.6) 


Our next aim is to establish an upper bound for the first sum on the right side. We have 


£ 

de&(b),e 


R( d,l) 2 

d\ ■ ■ ■ dj 


£ s(h,h), 

h,h<z[J] 


where 


(4.7) 


Sih.h) 


£ 

dG^(b) 

£ie/l ^* = £ig/ 2 


1 

d\ ■ ■ ■ dj 


If I\ = I 2 , then evidently S(Ii,I 2 ) = A)' 1 • • • X b / . If Ji and I 2 are distinct, let j = 
max(/iA/ 2 ) be the largest coordinate at which I\ and I 2 differ. With all of the quantities d* 
fixed except for dj, we see that dj is uniquely determined by the relation ^2 ieI dj = ^2 ieI 
If we dehne e(j) G [J] uniquely by 


bi + • • • + 5 e (j)-1 + 1 ^ j ^ bi + • • • + &e(i), 

then dj ^ regardless of the choice of A,..., dj- 1 , dj+i, _ , dj and thus 


S(h,I 2 )< n<£x 


i= 1 


dj 2 e (i)-i 


A? 1 • • • A 


X- 1 
j Mi) 


2 e 0)— 1 


V ■ ■ ■ X) J 

2 e (i) 


(Here, the sums over d t are over the appropriate dyadic intervals required so that d G ^(b).) 
Here we used the fact that A* x 1; in fact one may note that X t ^ Aj + i for all i (since 
A ^ W + inTl) an d that lim^oo A* = log 2, so in fact A* ^ log 2 for all i. 
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Since the number of pairs of subsets I\,l 2 C [J] with max(/iA/ 2 ) = j is exactly 2 J+J 1 , 
we get from this and (14.711 that 

II ^ ^ d ’ ^ < 2 J + 22 i_e(j) = 2 J + 2 ' 1 2 _i 2j 


j =i de^(b)y 


j=i 

j 


* =1 j'-e(j)=i 


■+bi—i 


< 2 J + 2 J ^ 2 6l+ ‘ H 
i=l 
J 

•C 2 J 2 bl+ " H 
2=1 

Comparing with (14.6H . and using again that A* ^ log 2, completes the proof. 


■+bi—i 


□ 


Combining Lemma [4. II and (14.5(1 . we obtain 


E|2S?(X fc )|» 


(2 log 2) J 
k 


E 

bi~\ -b bj—J 


6i! 


' W E..i 2‘ 


lH-b&i - 4 


(4.8) 


Somewhat surprisingly, the right hand side here can be evaluated explicitly using the “cycle 
lemma”, as in jForOSbj . The key trick is to add an additional averaging over the J cyclic 
permutations of b \,..., bj to the inner summation. 


Lemma 4.2. Let X\,xj be positive reals such that x\ ■ ■ - xj = 1. Then the average of 
( Yli =i x i ■ ■ ■ x i ) over cyclic permutations of x\ ,..., x j is exactly 1 / J. 


Proof. Reading indices modulo J we have 


j 

E 


i x t +1 ■ ■ ■ x t+i 


X\ ■■■Xt 


Ew . 

t =1 Z^i=l X 1 x t+i 


= 1. 


□ 


Applying the cycle lemma with ay = 2 bi 1 gives (noting that cyclic permutation of the 
variables is a 1-1 map on the set of (&!,..., bj) with b\ + • • • + bj — J) that 


, E . 

b\-\ -b bj—J 


1 

b\\- ■ - bj\ J2i=i 2 bl+ ' +bi ~ i 



bi~\ -b bj — J 



l J j 

1' jF’ 


the second equality being a consequence of the multinomial theorem. 

Substituting into (14.811 . and recalling that J = + 0(1), the lower bound in Proposi¬ 

tion 11.41 now follows from Stirling’s formula. 
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5. The upper bound in Proposition 11.41 
In this section we turn to the upper bound in Proposition 11.41 that is to say the bound 

E|JSf(X fc )| Ck a (logk)- 3/2 . 

As with the lower bound, we condition on the number of cycles of length at most k in a 
random permutation. Recall from (14.3p the definition of Jz?*(a): 


& *( a ) = {J2 ai: 1 c m}- 

From (13.ip . (13.2p and (14.2p we have 


iei 


ei miKTj E 


|J?*(a)| 


k ' r\ x —' ai • • • a r 

r ai,...,a r =l 


(5.1) 


The most common way for |2z?*(a)| to be small is when there are many of the cq which are 
small. To capture this, let ai,a 2 ,... be the increasing rearrangement of the sequence a, so 
that di ^ a 2 ^ . For any j satisfying 0 ^ j ^ r, we have 

2z?*(a) C <m + E ^ : 0 ^ m ^ X> I C{j + l,...,r} 

t i£l i=1 

from which it follows immediately that 

|JSf(a)| <G(a), 

where 


G( a) = min 2 r J (cq + • • • + a,- + 1) 

O^j^r 


(5.2) 


It is reasonable to expect that 


E 


CL\ j• • • j dj* — 1 


G(b) 

a\ - ■ ■ a r 


G( t) 


’1 /T 


4 4 


1 1 * tl 


■dt = (log fc) r 


G(e^ logfc ,..., e^ rloek )d£, (5.3) 


'0 jo 


where we have enlarged the domain of G to include r-tuples of positive real numbers. How¬ 
ever, G is not an especially regular function and so (15.3p is perhaps too much to hope for. 
The function G is, however, increasing in every coordinate and we may exploit this to prove 
an approximate version of (15. 3 H . 

Lemma 5.1. For any r ^ 1, we have 


E I^«(2/u.)V! 


ai• • • a r 


dp ^^ CLj' — 1 

where = {(£i,..., £ r ) : 0 ^ ^ £ 2 ^ ^ 6- ^ !}• 


‘ min 2 j (k +-h k^ j + 1 )d£, 
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Proof. Motivated by the fact that 1/a = dt/t, define the product sets 

r 

R (&) = n t exp ’ eX P (M] • 


i=l 


By (15,21) . we have 


l-^*( a )l < C( a ) 

/_—/ n. ■ ■ ■ n ' 


Gy / ^ Gy 

CL\j'**)Cbj' — 1 CL\ , • • • ^CL'p — X GL]_ 


E G(a)/ ^ 

a r = 1 B ' ' ' 




Consider some t G R( a). Writing ti ^ t 2 ^ ... ^ t r for the non-decreasing rearrangement 
of t, we have 

exp (h^- 1 ) E U if exp (h 5i ) for 1 ^ i ^ r. 

From (13. 2 p we see that t t ^ a,: for all z. Hence 

G(a) ^ min 2 r_J (fi + • • • + tj + 1) = G(t) for all t e /2(a). 

O^j^r 


This yields 

k 


E C(a)/ t J ^-r-i 

...,ar=l Jr ( a)H H ai « r= 1 V) B ^ ^ Pi 


fexp(/i fe ) 


g(t) 

^ * ~fc y 


dt. 


d\ ,... ,ttr— X ' ' — X 

The integrand on the right is symmetric inti,... ,t r . Making the change of variables L = 
yields 


E 


i-g’wi 

ay • • ■ a r 


^ (2 hk) r r\ / min 2 J (e^ l/lfc + • • • + e^' hfc + l) d£. 


In r °^r 


ai,...,a r =l 

The lemma follows from the upper bound in (13.21) . namely hk ^ 1 + log/c. 


□ 


With Lemma PTTl established. we may conclude the proof of the upper bound in Proposition 
11.41 bv quoting [Ford8b 1, Lemma 3.6]. Indeed, in the notation of that paper 

/ min 2~ j ( k^ +-f- kf :i + X) — U r ( log 2 k), 

Jn r 


and thus by (15.11) and Lemma [5.II we have 

E|JSf(X fc )| « ^(2/i fc ) r £/ r (log 2 A:). 

r 

Now [For08bi Lemma 3.6] provides the bound 

1 + | log 2 k — r| 2 


(5.4) 


f/ r (log 2 k) < 


(r + l)!( 2 r-lo & 2 fc + 1)’ 


uniformly for 0 ^ r ^ 10 log 2 k . Set 


r * = Llog 2 k\. 
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In what follows, we will use the observation that a 71 /(n + 1)! is increasing for n ^ a — 2 and 
decreasing thereafter. If r = r* + m with m ^ 9 log 2 k , m G then we have 


(2h k yu r (io g2 k ) < 


'ihi 


•C 


(r + 1)! 


[r* 

T +- 1 


1 )! 


l+log log 2 

•C fc + lo s 2 (log A:) 


1 + m 2 
2 m 

1 + m 2 

'IF" 

-3/2 1 + m 2 

(i) m ' 


In the first step we used the observation (and the fact that | and in the second step 

we used Stirling’s formula and (13.2j) . Summed over m, this is of course rapidly convergent 
and shows that the contribution to (15.4j) from this range of r is acceptable. 

Next suppose that r = r* — m, m E N. Then we have 


(2h k ) r U r {\og 2 k) « 
< 


(I h k y 

(r + 1)! 

(I h k y* 

(r* + 1)! 



•C k 


i+ 


l+log log 2 
log 2 


(log k) 


-3/2 


1 + m 2 


Here, we used the observation (and the fact that | > ^h_) and a second application of Stir¬ 
ling’s formula. Summed over m, this is once again rapidly convergent and the contribution 
to (15.41) from this range of r is acceptable. 

There remains the range r > 101og 2 fc. Here, we use the trivial bound U r (log 2 k ) ^ 1/r! 
and thus 

^ (2h fc ) r H r (log 2 k) « ^ 

r> 101og 2 /c r>101og 2 /c 

which is obviously minuscule in comparison to the other terms. 

Remarks. It is obvious from this analysis and the lower bound in our main theorem that 
a proportion ^ 1 — £ of all permutations fixing some set of size k have log 2 k + 0(log(l/e)) 
cycles of length at most k. It is most probably also true that for a proportion ^ 1 — £ of all 
permutations fixing some set of size k we have log dj ^ j log2 — O e ( 1) for j ^ log 2 k — O e (l), 
where the aj are the (ordered) cycle lengths of the permutation. To establish this would 
require opening up some of the arguments used to bound the quantities U k in [For08b] . We 
plan to return to this and other issues in a future paper. 
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